Modulation Cepstrum Discriminating between Speech and Environmental Noise

نویسندگان

Tooru Miyoshi

Takahito Goto

Takaaki Doi

Taeko Ishida

Takayuki Arai

Yuji Murahara

چکیده

We introduce "the modulation cepstrum," a novel representation of an acoustic signal used to distinguish speech signal and environmental noises. The modulation cepstrum was computed by taking the inverse Fourier transform of the logarithmic modulation spectrum, a spectral representation of the temporal dynamics of a sub-band. We calculated the center of gravity of accumulated the modulation cepstrum for eight seconds of an acoustic signal as an index. The experimental result showed that this index enabled us to discriminate between speech and noise signals.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Leveraging Jointly Spatial, Temporal and Modulation Enhancement in Creating Noise-robust Features for Speech Recognition

This paper presents to adopt various fusion types of spatial, temporal and modulation domain speech feature enhancement techniques in order to achieve superior speech recognition performance under noise-corrupted environments. With the mel-frequency cepstral coefficients (MFCC) as the standard speech feature representation, the spatial-domain techniques involve the short-time intra-frame featur...

متن کامل

AM-FM Based Robust Speaker Identification in Babble Noise

Speech babble is one of the most challenging noise interference due to its speaker/speech like characteristics for speech and speaker recognition systems. Performance of such systems strongly degrades in the presence of background noise, like the babble noise. Existing techniques solve this problem by additional processing of speech signal to remove noise. In contrast to existing works, the aim...

متن کامل

Robust Speech Recognition with MSC/DRA Feature Extraction on Modulation Spectrum Domain

This report introduces noise robust speech recognition and proposes advanced speech analysis techniques named MSC (Modulation Spectrum Control)/DRA (Dynamic Range Adjustment). The dynamic range of cepstrum obtained from noisy speech is usually smaller than that from the same speech without noise since some speech features are hidden in noise. This difference may cause recognition errors. Theref...

متن کامل

Fusion of Acoustic, Perceptual and Production Features for Robust Speech Recognition in Highly Non-stationary Noise

Improving the robustness of speech recognition systems to cope with adverse background noise is a challenging research topic. Extraction of noise robust acoustic features is one of the prominent methods used for incorporating robustness in speech recognition systems. Prior studies have proposed several perceptually motivated noise robust acoustic features, and the normalized modulation cepstral...

متن کامل

Physiologically Motivated Feature Extraction for Robust Automatic Speech Recognition

In this paper, a new method is presented to extract robust speech features in the presence of the external noise. The proposed method based on two-dimensional Gabor filters takes in account the spectro-temporal modulation frequencies and also limits the redundancy on the feature level. The performance of the proposed feature extraction method was evaluated on isolated speech words which are ext...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2002

Modulation Cepstrum Discriminating between Speech and Environmental Noise

نویسندگان

چکیده

منابع مشابه

Leveraging Jointly Spatial, Temporal and Modulation Enhancement in Creating Noise-robust Features for Speech Recognition

AM-FM Based Robust Speaker Identification in Babble Noise

Robust Speech Recognition with MSC/DRA Feature Extraction on Modulation Spectrum Domain

Fusion of Acoustic, Perceptual and Production Features for Robust Speech Recognition in Highly Non-stationary Noise

Physiologically Motivated Feature Extraction for Robust Automatic Speech Recognition

عنوان ژورنال:

اشتراک گذاری